Enhancing Recall in Information Extraction through Ontological Semantics

نویسندگان

  • Sergei Nirenburg
  • Marjorie McShane
  • George W. Bush
چکیده

We proceed from the assumption that extracting and representing the meanings of texts that serve as sources for information extraction will enhance the latter’s quality. In particular, we believe that resolving reference in these texts will lead to higher levels of recall in IE because additional information will become available for extraction once it can be captured not simply by matching character strings in the IE template but by knowing that George W. Bush, President Bush, the current president of the US, the leader of the free world, and the winner of the 2000 National election all refer to the same entity and, therefore, whatever information in the text is introduced by any of the above (and other reference means, notably, pronominalization and ellipsis) is relevant.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Ontological Smoothing for Relation Extraction

There is increasing interest in relation extraction, methods that convert natural language text into structured knowledge. The most successful techniques use supervised machine learning to generate extractors from sentences which have been labeled with the arguments of the relations of interest. Unfortunately, these methods require hundreds or thousands of training examples, which are expensive...

متن کامل

Enriching OWL with Instance Recognition Semantics for Automated Semantic Annotation

Although OWL provides a solid basis for many semantic-web applications, it lacks sufficient declarative semantics for instance recognition. This omission prevents OWL from being a satisfactory ontology language for automated semantic annotation. We can resolve this problem by adding to ontological OWL declaration epistemological instance recognition semantics that include external representatio...

متن کامل

Evaluating Evaluation Metrics for Ontology-Based Applications: Infinite Reflection

In this paper, we discuss methods of measuring the performance of ontology-based information extraction systems. We focus particularly on the Balanced Distance Metric (BDM), a new metric we have proposed which aims to take into account the more flexible nature of ontologically-based applications. We first examine why traditional Precision and Recall metrics, as used for flat information extract...

متن کامل

High Level Multimodal Fusion and Semantics Extraction

This thesis on multimodal fusion and semantics extraction, focuses on automated detection and annotation of harmful content in video data. The aim is not only to reason out the existence of violence or not (i.e. the binary problem), but also to determine the type of violence (e.g. fight, explosion, murder). Acknowledging the lack of knowledge representation and reasoning approaches for the prob...

متن کامل

DBpedia based Ontological Concepts Driven Information Extraction from Unstructured Text

In this paper a knowledge base concept driven named entity recognition (NER) approach is presented. The technique is used for information extraction from news articles and linking it with background concepts in knowledge base. The work specifically focuses on extracting entity mentions from unstructured articles. The extraction of entity mentions from articles is based on the existing concepts ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003